Learning to Evaluate Performance of Multimodal Semantic Localization

نویسندگان

چکیده

Semantic localization (SeLo) refers to the task of obtaining most relevant locations in large-scale remote sensing (RS) images using semantic information such as text. As an emerging based on cross-modal retrieval, SeLo achieves semantic-level retrieval with only caption-level annotation, which demonstrates its great potential unifying downstream tasks. Although has been carried out successively, but there is currently no work systematically explores and analyzes this urgent direction. In paper, we thoroughly study field provide a complete benchmark terms metrics testdata advance task. Firstly, characteristics task, propose multiple discriminative evaluation quantify performance The devised significant area proportion, attention shift distance, discrete distance are utilized evaluate generated map from pixel-level region-level. Next, standard data for contribute diverse, multi-semantic, multi-objective Localization Testset (AIR-SLT). AIR-SLT consists 22 RS 59 test cases different semantics, aims comprehensive evaluations models. Finally, analyze models detail, explore impact variables We have also established new paradigm referring expression comprehension, demonstrated advantage semantics through combining it tasks detection road extraction. proposed metrics, testsets, corresponding scripts open access at https://github.com/xiaoyuan1996/SemanticLocalizationMetrics.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

the norms of localization in translating persian multimodal texts: the case of videogame demos

abstract هنجارهای بومی سازی در ترجمه متون چندوجهی فارسی:مورد دموهای بازیهای کامپیوتری چکیده اهداف عمده مطالعه حاضر به سه دسته تقسیم میشوند: 1) بررسی مشکلات احتمالی ترجمه دموهای (فیلمهای) بازیهای کامپیوتری،2) تعیین هنجارهای بومی سازی در ترجمه دموهای (فیلمهای) بازیهای کامپیوتری و 3) تعیین ایدئولوژیهایی که این هنجارها در جامعه نشان میدهند. به این منظور، ابتدا، مجموعه ای ازدموهای (فیلمهای) ب...

15 صفحه اول

on the comparison of keyword and semantic-context methods of learning new vocabulary meaning

the rationale behind the present study is that particular learning strategies produce more effective results when applied together. the present study tried to investigate the efficiency of the semantic-context strategy alone with a technique called, keyword method. to clarify the point, the current study seeked to find answer to the following question: are the keyword and semantic-context metho...

15 صفحه اول

Multimodal Semantic Learning from Child-Directed Input

Children learn the meaning of words by being exposed to perceptually rich situations (linguistic discourse, visual scenes, etc). Current computational learning models typically simulate these rich situations through impoverished symbolic approximations. In this work, we present a distributed word learning model that operates on child-directed speech paired with realistic visual scenes. The mode...

متن کامل

a frame semantic approach to the study of translating cultural scripts in salingers franny and zooey

the frame semantic theory is a nascent approach in the area of translation studies which goes beyond the linguistic barriers and helps us to incorporate cognitive and cultural factors to the study of translation. based on rojos analytical model (2002b), which centered in the frames or knowledge structures activated in the text, the present research explores the various translation problems that...

15 صفحه اول

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Transactions on Geoscience and Remote Sensing

سال: 2022

ISSN: ['0196-2892', '1558-0644']

DOI: https://doi.org/10.1109/tgrs.2022.3207171